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In game theory, players have continuous expected payoff functions and can use fixed point theo- 
rems to locate equilibria. This optimization method requires that players adopt a particular type of 
probability measure space. Here, we introduce alternate probability measure spaces altering the di- 
mensionality, continuity, and differentiability properties of what are now the game's expected payoff 
functionals. Optimizing such functionals requires generalized variational and functional optimization 
methods to locate novel equilibria. These variational methods can reconcile game theoretic predic- 
tion and observed human behaviours, as we illustrate by resolving the chain store paradox. Our 
generalized optimization analysis has significant implications for economics, artificial intelligence, 
complex system theory, neurobiology, and biological evolution and development. 



u 
o 



(N 
> 



o 
o 



X 



I. INTRODUCTION 

In game theory, as formaUzed by von Neumann and 
Morgenstern [l| , Nash [^ ^j , and Kuhn |j] , rational play- 
ers with common knowledge of rationality (CKR) locate 
equilibria by using fixed point theorems to optimize con- 
tinuous expected payoff functions. These expected pay- 
off functions, according to probability measure theory 
|M 1^ Qi can only be defined after the adoption of a 
suitable probability measure space supporting appropri- 
ate random variables, functions, and probability distri- 
butions. For instance, mixed strategy probability mea- 
sure spaces were used by von Neumann and Morgenstern 
[l| and Nash 0, y|i while behavioural strategy proba- 
bility measure spaces were introduced by Kuhn |^. In 
addition, correlated strategy probability measure spaces 
were introduced by Aumann to model communication 
channels between players [g. In this last case, com- 
munications necessitate a change of probability measure 
space, however a change of probability space does not 
always require communication. Consequently, in this pa- 
per we introduce a method to analyze games using the 
infinite number of different probability measure spaces 
available to describe any given game and set of expected 
payoffs [a, S 13 ■ Our particular interest lies in the class 
of probability measure spaces which is consistent with 
the given game information constraints. That is, we con- 
sider only probability measure spaces which are consis- 
tent with rationality, CKR, and no communication chan- 
nels between players. Such probability measure spaces 
can exist, as we show later, simply because a number of 
different probability measure spaces are consistent with 
information flow via the game history set without any 
communication channels. In this paper, we suppose play- 
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ers may freely alter their choice of probability measure 
space among all those consistent with no communica- 
tions or any other alteration in the game, in contrast to, 
for instance, previous work on correlated equilibria |3]. 
For many games, a change in the underlying probabil- 
ity measure space will not affect equilibria — witness the 
equivalence of mixed and behavioural strategies in games 
of perfect recall |j|. However, in this paper, we argue 
that there exist games in which altering the choice of 
probability measure space will alter strategic equilibria. 
Assuming rationality, CKR, and the usual game infor- 
mation constraints, players can search an enlarged space 
of alternate probability measure spaces to optimize their 
expected payoffs, and thereby locate novel equilibria im- 
proving their outcomes over those achieved using only 
the conventional mixed or behavioural strategy probabil- 
ity spaces of game theory. 

In this paper, we assess for the first time whether 
the set of equilibria of any arbitrary game are entirely 
invariant under the altered mathematical parameteriza- 
tions defined by different probability measure spaces. It 
does appear that equilibria are indeed invariant under al- 
ternate probability measure spaces for single-player and 
multiple-player-single-stage games. However, equilibria 
are not invariant under altered choice of probability mea- 
sure space for multiple-player-multiple-stage games. In 
these games, the adoption of alternate probability mea- 
sure spaces by players can so alter the parameterized ex- 
pected payoff functions as to generate entirely novel sets 
of equilibria. 

Demonstrating this requires a significant generaliza- 
tion of the usual optimization methods of game the- 
ory. This is because alternate probability measure spaces 
and parameterizations can alter the functional form, di- 
mensionality, continuity and differentiability properties 
of what must now be treated as expected payoff func- 
tionals (not functions). As a result, the multiple-player 
calculus methods (essentially fixed point theorems) suit- 
able for expected payoff functions defined over contin- 



uous probability simplexes are insufRcient. To optimize 
expected payoff functionals, we must generalize the varia- 
tional and functional optimization techniques used in, for 
instance, general equilibrium and Cass-Koopmans style 
optimal growth analysis lij. [lOl. Ramsey-style multiple 
stage optimization |lll Il2l ll.'J I , and continuous time dif- 
ferential games [l^. Suitably generalized, these varia- 
tional and functional optimization techniques can recon- 
cile game theoretic prediction and observed human be- 
haviour as we illustrate using Selton's chain store para- 
dox [13 • In this game, backwards induction predicts 
that a monopolist never fights new market entrants even 
though, in practice, most monopolists will indeed fight 
new entrants and thereby improve their payoffs. This 
led Selton to conclude "mathematically trained persons 
recognize the logical validity of the induction argument, 
but t hey refuse to accept it as a guide to practical behav- 
ior." ^5|- This stark contrast makes this game a suitable 
vehicle for the presentation of our new methods. 



II. VARIATIONAL OPTIMIZATION OF 
PROBABILITY MEASURE SPACES 

We consider the general strategic optimization problem 
faced by two players X and Y seeking to maximize their 
respected expected payoffs (H^) and (11^') in a game 
where X chooses events x and Y chooses events y to 
generate respective payoff outcomes for each player of 
11"^ (a;, y) and n^(x, y). The chosen events x x y are con- 
tained in il^ X n^ , the set of all possible events in the 
game and in both player's chosen "roulette" randomiza- 
tion devices. These devices are used by players to avoid 
their choices being forecast and exploited, with the result 
that the choice of events is described using a joint prob- 
ability distribution P^^ ■ As is required in probability 
measure theory [a,|ajl3' ^^^ definition of this joint prob- 
ability distribution requires player X to adopt a proba- 
bility measure space V^ , and player Y to adopt a prob- 
ability measure space V^ , such that the joint product 
probability measure space V^ x V^ supports the prob- 
ability measure P^y^ ■ We allow players to vary their 
choice of probability measure space to maximize their 
expected payoffs. Altogether, the strategic optimization 
problem facing each player is 



X: max (n^^) = / dPE^ n-^(x. 



y) 



Y: n.ax (H^) = / dP^/ n^(x,y) 



(1) 



Here, expected payoffs for each player Z e {X, Y} are 
defined by a Lebesgue integral over all possible game 
and roulette events Q^ x fl^ of payoffs II^{x,y) re- 
sulting from particular game events {x, y) weighted by 
the joint probability measure of those events occurring 
P^^y ■ The optimization involves each player Z maximiz- 



ing their expected payoff over every possible joint prob- 
ability measure space that might be adopted V^ x V^ , 
where P^ = {fl^ , a^ , P^} is defined in terms of an ap- 
propriate event set f2^ modelling all game and roulette 
device events, a suitable sigma-algebra a^ , and an ap- 
propriate probability measure P^ . 

Game theory has not previously allowed rational play- 
ers to vary their choice of probability space to maxi- 
mize their expected payoffs. This is largely because von 
Neumann and Morgenstern's original goal was to formu- 
late strategic plans assessing every possible move in a 
game [l| , and they considered this goal required only that 
each player adopt a particular probability measure space 
defining mixed strategies in any game. (Kuhn later in- 
troduced alternate behavioural strategy probability mea- 
sure spaces providing an equivalent analysis in games of 
perfect recall Q.) While never stated explicitly, this re- 
striction essentially limits the search space of the players 
so they can only optimize over the probability parame- 
ters of a single type of probability space using fixed point 
theorems to locate Nash equilibria. In contrast, we ar- 
gue that, under CKR, players can search every alternate 
probability space consistent with game information con- 
straints by using generalized variational and functional 
optimization techniques. In the remainder of this section, 
we seek to explain heuristically why such a generalized 
analysis can generate novel and improved equilibria, and 
thus reconcile game theoretic prediction and observed hu- 
man behaviours. 

Alternate probability measure spaces can support dif- 
ferent equilibria in strategic situations as each adopted 
probability space can mathematically parameterize the 
same random event in very different ways. For example, 
consider a player X seeking to optimize a binary out- 
come specified by a random variable taking value x = 
with probability P'^(O) or x = 1 with probability P^(l). 
These probabilities can be characterized in terms of a sin- 
gle probability parameter p by tossing a biased coin, or 
in terms of five probability parameters {pi,P2,P3-PiiP5) 
say by using a biased dice. An alternative probabil- 
ity measure space might employ two sequentially tossed, 
independent, biased coins producing outcomes u = 1 
with probability p, while if m = then v = 1 with 
probability q and if w = 1 then v = 1 with probabil- 
ity r. The subsequent adoption of the random variable 
X = SuiSvi defines P^(l) = P{u = l,i; = 1) = pr. 
(Here, Sat = 1 if a = 6 and zero otherwise.) As a last il- 
lustration, consider a probability measure space in which 
the above two biased coins are now perfectly correlated 
via P{u,v) = P{u)P{v\u) = P{u)Suv = Piu). In this 
case, the known perfect correlation introduces a delta 
function to reduce the dimensionality of the joint distri- 
bution P{u,v) giving P'^ (!) = P{u = l,v = \) = p. In 
general, when parameterized using different probability 
measure spaces, a given probability possesses alternate 
functional forms with different dimensionality, correla- 
tion, continuity, and differentiability properties. 

This changeability of functional form and dimension- 



ality requires generalized variational and functional opti- 
mization methods be used to optimize strategic decisions. 
The generalized methods we develop extend the calcu- 
lus of variations which typically optimizes a functional 
^[/(2^)i /'(s;)] of known form, and where the functional 
F, the function f{x), and the gradient f'{x) have spec- 
ified differentiability properties. For instance, a short- 
est path problem seeks to optimize the known functional 



max /= / Jl + f'^dx. (2) 

fJ' Ja 

Similarly, the shortest time or Brachistochrone problem 

optimizes the known functional F[f{x), f'{x)] - 

via 



2s/ 



max / = 







■dx. 



(3) 



Lastly, a typical multiple stage Ramsey-style utility max- 
imization problem optimizes 



max / 



e-"F[/ - /']dx. 



(4) 



where now only the functional dependencies and 
certain differentiability properties of the functional 
F[f{x),f'{x)] are specified. To our knowledge, all appli- 
cations of the calculus of variations place severe restric- 
tions on the range of variation of the form of the func- 
tional being optimized, so much so that a problem with 
an entirely arbitrary functional would be considered ill 
defined. In contrast, in a strategic optimization problem, 
players are able to arbitrarily vary their choice of prob- 
ability measure space to alter all of the functional form, 
the dimensionality, and the continuity and differentiabil- 
ity properties of the functional being optimized. Heuris- 
tically, in single player terms, the optimization problem 
becomes 



i:v^Tf^dx 



max / = < 

fj' 



i:^dx 

j\---F[f ~ r]dx 



(5) 



That is, each player has the option of first choosing a pa- 
rameterizing probability measure space to alter the func- 
tional form, dimensionality, continuity and differentia- 
bility properties of the functionals being optimized, and 
only then to optimize the chosen functional over all possi- 
ble variations of f{x) and f'{x). More importantly, each 



of their choices affects their opponent's functionals, while 
at the same time, their opponent's decisions arc similarly 
altering their own functionals. 
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f(x)+Sf(x) 
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FIG. 1: The variational optimization of the functional 
F[f{x),f'(x)] requires the variation of both the function 
fix) -^ f(x) + Sf{x) and independently, its gradient f'{x) — > 
f'{x) + Sf'{x) over the region [a,b]. That is, through any point 
"A", every possible gradient must be considered in order to 
perform a complete variational analysis. 

We suggest that this variability of the strategic func- 
tionals means that optimization requires independent ex- 
amination of every possible functional, and every possible 
functional gradient, that might be defined by the play- 
ers. That is, we generalize the standard optimization 
algorithm of the calculus of variations in which func- 
tionals F[f{x),f'{x)] of known form are optimized by 
an independent variation of the function / and the gra- 
dient /'. This independent variation of each of the co- 
ordinates (/, /') over every possible value allows for in- 
stance, derivation of the Euler-Lagrange equations pro- 
viding the first order optimization conditions. This is 
depicted in Fig. ^ showing that every possible gradient 
and trajectory through any point "A" in the parameter 
space must be considered to locate optimal trajectories. 
Any restrictions on this search of all possible trajectories 
constrains the optimization. For instance, when players 
are restricted to using only a particular type of proba- 
bility measure space, i.e. mixed or behavioural strategy 
spaces, then expected payoff functions have fixed func- 
tional form, are continuous, and possess a single gradient 
at every point in the joint function space. These restric- 
tions allow use of the calculus (effectively fixed point the- 
orems) rather than a generalized calculus of variations to 
locate equilibria. 

We argue that, under CKR, players should potentially 
benefit from the ability to search an enlarged mathemat- 
ical space including many alternative joint probability 
measure spaces. A complete search of this enlarged math- 
ematical space requires that they examine not only every 
possible value of the expected payoff functions at every 
point in their parameter space, but also every possible 
gradient at every one of those points. In the follow- 
ing, we show that different probability measure spaces 



can associate different gradients with the same point in 
the joint expected payoff function space, and we argue 
that every such possibie gradient must be tafcen into 
account in any compiete variationai and functionai op- 
timization. That is, when piayers X and Y are seek- 
ing to optimize their respective expected payoffs (H"^) 
and (n^), they must examine not oniy every possibie 
pair of joint values ((11'^), (H^)) but also every possi- 

ble joint gradient (^, ^, . . . , ^, ^, . . .) 
evaluated with respect to every possible parameteriza- 
tion {pi,p2, ■ ■ ■ ,q_i,<l2, ■ ■ ■) defined in every possible joint 
probability measure space. 
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FIG. 2: A simple example two-player-two-stage perfect infor- 
m,ation game showing the different game decision trees specific 
to players adopting the respective joint probability spaces (a) 
Vif X P^ where optimal player choices are {x,y) = (0,1) 
generating expected payoffs ((11^), (11^)) — (2,2), and (b) 
V^ X VY where optimal player choices are {x,y) = (1,1) 
generating expected payoffs ({11'^), (11^)) = (4,3). 



III. VARIATIONAL OPTIMIZATION IN 
MULTIPLE STAGE GAMES 

In this section, we use a simple two-player-two-stage 
game to introduce standard mathematical methods that 
have not previously been applied in game analysis. Our 
goal is to demonstrate that even simple games can exhibit 
expected payoffs with multiple functional forms, multiple 
gradients and multiple trajectories at the same point in 
the parameter space necessitating use of variational op- 
timization methods. 

Suppose two players, denoted Z e {X,Y}, each seek 
to optimize their own outcomes in a strategic interaction, 
where in stage one X makes a choice of a; = or a; = 1 . 
In the second stage, player Y is aware of the opponent's 
previous choice and must also make a choice of y = or 



y — 1 at which point the game terminates and players 
obtain payoffs U^{x,y) as shown in Fig. [2Ia). 

Game theory analyzes this game by having each player 
adopt a joint probability space allowing the complete 
analysis of every possible choice that might be made 
in the game. In this game, both players suppose that 
X adopts a probability space V^ with random variable 
X e {0, 1} taking value x = 1 with probability p. In turn, 
both players suppose that player Y chooses a probability 
space allowing for any degree of correlation p between 
the observable game events x and y, that is, that these 
variables might be perfectly correlated p^y = 1, or per- 
fectly anti-correlated pxy = —1, or entirely uncorrelated 
Pxy — 0, or any value in between. Player Y does this by 
adopting the probability space V^ with random variables 
y^UjV G {0, 1} with u and v independent and taking val- 
ues u = 1 with probability q and v = I with probability 
r. The random variable y is functional determined to be 



giving 



P^(y|x) - <^ 



u if a: = 
V if a; = 1, 

• P^(0|0) = l-q 

p^{m = q 

P^(0|1) == 1-r 



(6) 



(7) 



I P^(l|l) 



As desired, this choice of probability space allows the 
players to examine every possible correlation state be- 
tween X and y defined as 

(xy) - {x){y) 



Pxyip,q,r) 



VWT~WVW)^W 

^p{l ~p){r - q) 



(8) 



Then, x and y are perfectly correlated at Pxyip, 0, 1) = 1, 
perfectly anti-correlated at Pxyip, 1,0) = —1, and uncor- 
related if either p = Oorlorg = r giving pxy = 0. 
As shown in Fig. |2Ia), in the joint probability space 
V^ X Vq , the expected payoff functions are 
1 

(n^ob,9,0) - ^P^(x)F^(2/|x)n^(x,2/) 

xy—0 

= 3-q+p{q + 3r-2) 

1 

xy=0 

= l + q~p{q + r^3), (9) 

so the gradients with respect to the three continuous de- 
pendent variables p, q and r are 

d{IlUP,q,r)) 



dp 



q + 3r-2 



(a) 



(b) 



(c) 



U^ = l+2p 












U^ = 2-p 
If =l+2p 




FIG. 3: (a) Game theory adopts a joint probability measure space "Pg ^ T-'o ^'^ which expected payoffs vary over three dimensions 
{p, q, r) and where positive gradients with respect to q and r (dotted arrows) and with respect to p (solid arrow) ensure that 
players maximize joint payoffs by choosing (p,q,r) = (0,1,0). (b) An alternate joint probability space Vq X Vi in which 
expected payoffs vary solely over a single dimension p with positive gradients with respect to p (solid arrow) ensuring that 
players optimize payoffs by choosing p — 1. (c) The choice of two alternate probability spaces (more are possible) associates two 
different total gradients (double-lined arrows) with any point along the perfect correlation line pxy = 1 at {q, r) = (0, 1) with 

I 00 P-i'"^ ^ 00 P'9''' ^ "'ar''''' ) ^ ( a^ ^ I . In the absence of any effective decision procedure privileging 

V / {g,r)^(0,l) V / 

any one space over another, players should examine all possible spaces, all possible gradients, and all possible trajectories. 



d{nUp,q,r)) 

dq 
d{UUp,q,r)) 

dr 



1-p 



-P- 



(10) 



As shown in Fig. Ota), this three-dimensional gradient 
exists and is non-zero even when x and y are perfectly 
correlated pxy = 1 at all points {q,r) = (0,1) so pay- 
offs are not optimized at these points. In fact, given 
the choice of probability space V^ x P^ , both play- 
ers conclude that Y maximizes their payoff by setting 
{q,r) = (1,0) while X maximizes their payoff by setting 
p = 0. The resulting move choices are {x, y) = (0, 1) gen- 
erating payoffs of ((n-^), (n^)) = (2, 2). This completes 
our analysis of the usually adopted joint probability mea- 
sure space, and we now turn to examine alternatives. 

In any game, alternate joint probability measure spaces 
exist with expected payoff functions of different func- 
tional form and different gradients at the same point in 
the parameter space. Suppose that player Y chooses a 
different probability space V^ in which they treat the 
observed value of the random variable a; as a coin toss de- 
termining their choice oi y — \ with probability p. That 
is, Y functionally assigns the random variable y to be 
perfectly correlated with the observed random variable x 
via 



y = X 

Oil 



This functional assignment does not require any com- 
munication between player X and Y . Then, in the joint 
probability space V^ x VY , the expected payoff functions 
straightforwardly equal 

1 

xi;— 

= 3+p 
1 
{nUp)) = 5]F^(2:)P^(y|x)n^(x,y) 

xy—0 

= l + 2p, (12) 

as seen in the decision tree of Fig. [Ub) , and in the ex- 
pected payoff function space of Fig. ^h). These ex- 
pected payoff functions are now dependent only on the 
single freely varying parameter p determining the gradi- 
ent with respect to p to be 



dp 



(13) 



P'{y\x) 



(11) 



Consequently, player X maximizes their payoff by setting 
p = 1 to choose a; = 1 leading F to set y = 1. Thus, in 
the joint probability space V^ x P^, player payoffs are 

((n^),(n^)) = (4,3). 

We now have two possible joint probability spaces; that 
normally adopted in game theory V^ x Vq and the novel 




Optimize via 
Calculus 



Optimize via functional and variational calculus 

FIG. 4: A schematic representation of a complete analysis of the simple game of Fig. [^ showing that each player must first 
decide which probability space to adopt. Here, players X and Y simultaneously choose their respective probability spaces Vi and 
Vj from among an infinite number of possibilities, where each choice generates a different decision tree for the game defining 
altered payoff functions. Players do not know their opponent's choice of probability space. Given the adoption of a particular 
joint probability space pf x Vj , expected payoff functions are known and continuous in terms of their input parameters so the 
calculus, suitably generalized for multiple-player strategic interactions, can be used to optimize payoffs. However, optimizing 
the choice of joint probability space involves expected payoff functionals requiring players to use variational calculus methods to 
optimize payoffs. 



V^ X VY . In these alternate spaces, the expected pay- 
off functions possess exactly the same value when x and 
y are perfectly correlated but possess entirely different 
gradients at this point — see Fig. Ol^c). Variational opti- 
mization principles insist that every possible functional 
form and gradient must be taken into account in any 
complete optimization. These principles permit players 
to infinitely vary the "immutable" functional assignments 
defining any space (i.e. y = S^qu+Sxiv and y = x above), 
providing access to a vastly larger decision space than 
usually analyzed in game theory. It is not a question of 
which space is best, rather, it is a question of either re- 
stricting the analysis to a single space or allowing players 
to analyze all possible spaces. 

Game theory adopts expected payoff "functions" 
(n^Q (p, g, r)) allowing examination of every possible com- 
bination of payoff values and assumes that this is suffi- 
cient for optimization. However, while these functions 
can duplicate every possible payoff value, they cannot 
duplicate every possible functional gradient — and opti- 
mization depends on gradients. When Y adopts a ran- 
domization device (a "roulette" ) which perfectly cor- 
relates X and y via the probability space V^ , then 
certainly (nfo(p, 0,1)) = (nf;^(p)), but these func- 
tions have different dimensionality and gradients. That 



IS, 



( a{Il^„{p,q,r)) 
\ dp 



a(n^„(p,g,r)) 

dq ' 



d{nZ„{p,q,r)) 
dr 



^ 



(g,r)-^(0,l. 

I ^ 01 IP)/ j _ Similar results apply for points at dif- 
ferent correlation values pxy] should Y adopt a ran- 
domization device where y is entirely uncorrelated 
with X via a new probability space 'PI', then cer- 
tainly (nfo(p,g,g)) = (nf2(p,g)) but these func- 



tions again have 
dients (^S^MMi. 



dp ' 



-^021 

different dimensionality and 

■)) d{^m(p-q-r)) d{nZ„(p.,q.r))- 



dq 



dr 



gra- 



(9SkA2M^ g'Uio2iP.9J; j _ These inequalities result as the 

usually adopted space Vq x Vq evaluates gradients using 
infinitesimals between points with different correlations 
so /\pxy = Pxy{p,q,r)-pxy{p + Sp,q + Sq,r + Sr) ^0. In 
contrast, when a roulette possesses a known correlation 



state as in the spaces Vq x "Pf or V^ 



X 



VI, 



then gra- 



dients are evaluated taking all constraints into account 
ensuring Ap^y = 0. Game analysis does not include every 
possible correlation constraint or every possible roulette, 
and taking these alternatives into account requires the 
variational methods presented in this paper. 

We suggest that the example game described in the 
two decision trees of Fig. |21 is best seen as having 
the schematic form shown in Fig. ^ in which play- 



X: 



n^n^- 




FIG. 5: An extended decision tree for the chain store paradox where player X adopts the usual probability space V^ with 
certainty while Y has a choice of three alternate probability spaces Vj for j £ { — , 0, +} selected with probabilities a, b, and c, 
and respectively denoting anti- correlated, independent, and correlated decision making. Game theory examines only the branch 
generated by the joint probability space V^ x Vq in which a potential new market entrant X must decide to either stay out of 
a new market a; = or enter the market x — 1 (with probability p), in which case the monopolist Y chooses to either acquiesce 
y = or fight their entry y = 1 (with probability r), with the corresponding payoffs shown. 



ers must first choose which probability space they will 
adopt, where this choice is unknown to their opponents 
at the commencement of the game, and must then op- 
timize their payoffs given the possible joint probabil- 
ity spaces that might be adopted. In such generalized 
trees incorporating choice of probability space, standard 
approaches can be applied to locate pure "variational" 
strategies, probabilistic "variational" mixed and "varia- 
tional" behavioural strategies, and "variational" equilib- 
ria. Of course, introducing "variational" mixed and be- 
havioural strategies means that players must introduce 
yet further probability spaces allowing the optimization 
of these probabilistic strategies. 

To provide a concrete illustration of our approach, we 
now show that rational players using variational opti- 
mization methods can resolve the chain store paradox. 



IV. RESOLVING THE CHAIN STORE 
PARADOX 

A minimal chain store paradox is shown as the central 
branch in Fig. generated by the adoption of the joint 
probability space V^ x Vq (defined below). This game is 
played over two sequential stages where first, a potential 
market entrant X must decide to either stay out of a new 
market a; = or enter that market x — 1. Their oppo- 
nent, the monopolist Y, observes this choice. Should no 
market entry occur, X neither gains nor loses any payoff 
while Y gains monopolist profits so (11"^, 11^) = (0,1). 



In contrast, should X enter the market, Y must then 
decide whether to acquiesce to their opponent's entry 
2/ = by leaving prices unchanged and sharing profits so 
(II'^jII^) = (1,0), or by driving X out of business by 
price cutting so payoffs are (11'^, 11^) = (—1, —1). 

A backwards induction analysis of the central branch 
of Fig. Oin isolation indicates that X will enter the mar- 
ket confident that the monopolist will not forego profits 
to fight their entry ;15J. Based on this, many economists 
argue it is irrational for monopolists to engage in preda- 
tory pricing to drive rivals out of business as predation is 
costly while potential new entrants well understand that 
price cuts are temporary and monopoly profits readily 
attract new market entrants ^iy|. Efforts to resolve the 
paradox include introducing multiple stages permitting 
reputation and deterrence effects [T^, as well as asym- 
metric information, mistakes, bounded rationality or im- 
perfect information and uncertainty J^ JX J^, JJ., 2^ ■ 
For a review, see [21j|. 

The decision and payoff combinations above define the 
general strategic optimization problem faced by the play- 
ers in the chain store paradox as 



X:max {H^) = P-^(l) [l - 2P^(1|1)] 



(14) 



F:max {W ) = 1 - P^ (1) - P^ {1)P'' {1\1). 

'pY 

Here, players alter their choice of joint probability space 
•pX ^ -pY g^g}^ that, within the selected optimal joint 
probability space, the optimization of their respective 



probability distributions P-^{x) and P^ {y\x) allows opti- 
mal choices {x, y) to be made so as to maximize respective 
payoflFs. 

A complete derivation of the "variational" equilibria 
of the extended tree of Fig. (S) is of course possible (and 
indicates that as long as Y sets c + br > i, most eas- 
ily achieved by choosing c = 1, then X maximizes their 
payoff through the choice p — 0). In this paper, we lo- 
cate pure variational equilibria in the chain store game. 
That is, we suppose that player X always chooses V^ 
(other choices are possible) while player Y chooses with 
certainty any of the three probability spaces Pj with 
j G {— , 0, -I-}. The interpretation of these latter spaces is 
that j ="— " indicates that y is perfectly anti-correlated 
to X, j ="0" indicates that y is entirely independent, and 
j ="-!-" indicates that y is perfectly correlated to x — see 
Fig.El 

First, we replicate the usual game analysis by suppos- 
ing that player X adopts a probability space V^ with 
random variable x G {0, 1} such that x — 1 with proba- 
bility p, while player Y adopts the probability space V^ 
with random variables y,u,v G {0,1} such that u and 
V are independent random variables taking value u ~ 1 
with probability q and v = 1 with probability r, and 
where the random variable y is functionally assigned as 
y — Sxou + SxiV- Altogether, this gives the same proba- 
bility parameterization for P^ {y\x) as appears in Eq. 
In the joint probability space V^ x V^ , the optimization 
problem reduces to 

X: max {U^q) 



-P 



P^(1)[1-2P^(1|1)] 
= p(l - 2r) 

F:max «„) = 1 - P^(l) - P^(1)P^(1|1) 

r 

= 1 — p — pr^ 



(15) 



so the gradients with respect to the two continuous de- 
pendent variables p and r are 



5« 
dp 

g(ngo) 

dr 



l-2r 



'P- 



(16) 



Essentially then, the monopolist Y maximizes their ex- 
pected payoff by setting r — and always acquiesces to 
new market entrants, while X maximizes their payoff by 
choosing p = 1 and so always decides to enter the mar- 
ket. The resulting expected payoffs given that players 
adopt this sole perfect Nash equilibria of {p,r) = (1,0) 
are((n^),(n^))-(l,0). 

Suppose however that players X and Y choose the joint 
probability space V^ x V\ (the rightmost branch of Fig. 
Is)) where y is perfectly correlated with x via the functional 
assignment y = x and P{y\x) = 5yx altering the strategic 
optimization problem to 



X: max (n^+} 



P^{\) [l-2P^(l|l)] 



(17) 



r:«+) = 1-P^(1)-P^(1)P^(1|1) 
= l-2p. 

These expected payoff functions are continuous over the 
single freely varying parameter p giving the gradient 



g(n.f+) 

dp 



1 <0 



(18) 



ensuring that player X maximizes their expected payoff 
by setting p = Q and not entering the market. That is, 
when players (X, Y) adopt the V^ x V\ joint probability 
space, they maximize their payoffs via the combination 
{x,y) = (0,0) to garner payoffs {{li^), {IV^)) = (0, 1). 

Alternatively, in the anti-correlated joint probability 
space V^ x VY. (the leftmost branch of Fig. EJ, y is 
perfectly anti-correlated with x via y = (1 — x) and 
P{y\x) — 5y(i^x) giving the altered strategic optimiza- 
tion problem of 



X: max (n^_) 
p 



P-^(l) [1-2P^(1|1)] 
P 



(19) 



Y:{n^_) = 1-P-^(1)-P-^(1)P^(1|1) 
= l-P- 

Again, these are functions of the sole parameter p giving 
the gradients 



d{U^ 



X 



dp 



= 1 >0, 



(20) 



ensuring that player X sets p = 1 and chooses to en- 
ter the market. The result is that when players {X, Y) 
adopt the V^ x VY^ joint probability space, they maxi- 
mize their payoffs via the combination {x^y) — (1,0) to 
garner payoffs ((H^), (H^)) = (1,0). 

Altogether, when players consider only pure varia- 
tional strategies (specifying probability spaces and move 
choices), the various payoffs available are 



((n^>,(n^» 


v^ 


pY 




(1,0) 


V}; 




(1,0) 


VI 




(0,1), 



(21) 



making it evident that to maximize their expected pay- 
off, player Y must rationally elect to use probability space 
V\ in preference to either VY, or VY- That is, Y will un- 
dertake to functionally correlate their move to the previ- 
ous choice of the potential market entrant, and thereby 
deny themselves a choice about the setting of y once the 



game has commenced. In the probabiHty space P^, the 
optimization by player Y has no second stage component 
as the joint probabihty distributions are inseparable, and 
an opportunity for a second stage optimization exist only 
in the space V^ . Player Y foregoes a choice during the 
game itself knowing this to be payoff maximizing. Player 
X, being aware of this will not enter the market as in the 
minimal chain store game described by V^ x V^, en- 
try automatically invokes retaliation. We thus reconcile 
game theoretic prediction and observed human behaviour 
implying human players generally commence a strategic 
analysis by first optimizing their choice of probability 
space and only subsequently optimizing the probability 
distributions defined by that space. 

It is of course possible to consider a broader range of 
joint probability spaces for both players X and Y, though 
this will not substantially alter the conclusion here that 
it can be rational for a monopolist to punish market en- 
trants to resolve the chain store paradox. 



V. CONCLUSION 

This paper locates strategic equilibria using general- 
ized calculus of variation techniques and is thus consis- 
tent with, and extends, the more usual methods of game 
theory based on the fixed point theorems of the calculus 
[22. I23 . We hold that under CKR, players might often 
improve their outcomes by expanding their mathematical 
search space to include alternate probability spaces. Con- 
sequently, we allow players to first optimize their choices 
of probability measure space which alters both the ex- 
pected payoff functionals (not functions) and the joint 
probability distributions specifying move choices to lo- 
cate "variational" equilibria. Generally, these variational 
equilibria differ from Nash equilibria even in perfect infor- 
mation games such as the chain store paradox considered 
here. This is because first, players are uncertain about 
which joint probability measure space is in play, and sec- 
ond, each alternative space introduces different correla- 
tions rendering the joint probability distribution insep- 
arable and altering allowable subgame decompositions 
and the backwards induction analysis. We show that 
when rational players variationally optimize their choice 
of probability measure space to access "variational" equi- 
libria, then this can reconcile game theoretic prediction 
and observed human behaviour. To illustrate this, we 
demonstrated that our general variational and functional 
optimization approaches resolve the chain store paradox. 
This strongly suggests that rational players should, in 
fact, exploit unrestricted optimization in general. 

More generally, we suggest that selfish "homo eco- 
nomicus" might exploit variational optimization to ac- 
cess alternate probability measure spaces to exhibit al- 
truistic or cooperative behaviour whenever that is payoff- 
maximizing. This might explain for instance, the efficacy 
of state led development processes [2J, [23 and the indus- 
try wide correlations of the "Just-In-Time" Toyota pro- 



duction system [23, |27|. Consequently, variational op- 
timization may change the orientation and methods of 
evolutionary game theory |28| and quantum game the- 
ory [23, among other fields, while the need to search 
infinite numbers of joint probability spaces will reinforce 
the importance of learning, a principal feature of evolu- 
tionary economics 30]. Similarly, variational optimiza- 
tion will impact "selfish gene" theory which presently 
holds that genes optimize their fitness independently so 
altruism is explained by relatedness and the likelihood 
of shared genes ^|. In contrast, we suggest that mod- 
elling the evolution of emergent hierarchical complexity 
in living organisms requires taking account of alterna- 
tive probability spaces correlating system components; 
correlated entities together constitute an indivisible unit 
which must be optimized as a whole. Consequently, com- 
plex multicellular eukaryotes might well have optimized 
fitness by adopting correlating signals to multitask their 
dynamics by exploring alternate dynamical decision trees 
(organismal probability spaces) most likely by expansion 
of their RNA signaling capabilities |33 . \33L |3J| . Similar 
considerations mean that neural networks can endoge- 
nously modify correlations among their components to 
explore an infinite number of alternate dynamical trees 
to implement complex cognition. Game analysis also un- 
derlies the tree search "minimax" algorithms of artificial 
intelligence |3a, jSy, |33 which typically fail to emulate 
human intelligence. In chess playing, for instance, ex- 
pert human players typically employ pattern recognition 
and "chunking" jSg], and appear to be exploiting the 
same correlation information that underpins variational 
optimization. This is consistent with the "social intelli- 
gence" explanation for the runaway evolution of primate 
intelligence where individuals dynamically realign their 
strategic partnerships to correlate behaviours to optimize 
outcomes in competitive group settings |39ll40l| . 

It has long been thought that any strategic opti- 
mization problem was essentially equivalent to a possi- 
bly greatly enlarged non-strategic optimization problem. 
This equivalence arises as each player can introduce suf- 
ficient new variables to fully model all of the possible 
actions of all of their opponents. As a result, strate- 
gic optimization has been thought to be of equivalent 
complexity to, for instance, non-strategic physics opti- 
mization problems, and solved by similar methods such 
as the calculus of variations or the calculus. Some have 
argued the converse. A perceived fundamental incom- 
patibility between physics and biological complexity mo- 
tivated Mayr to claim that biology is an autonomous sci- 
ence rather than a subbranch of the physical sciences [4l| , 
with the factor missing in physics but present in biology 
being identified as "entailment" (essentially correlation) 
by Rosen |42| , while it has been unconventionally argued 
that information science is incomplete and that it is our 
growing understanding of genomic programming and bi- 
ological complexity that will contribute significant new 
insights in this field (J. S. Mattick, personal communi- 
cation). Variational optimization might well help close 



these perceived gaps. 
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